Probabilistic Parsing of Unrestricted English Text, With a I-Iighly-Detailed Grammar
نویسندگان
چکیده
Ezra Black, S t e p h e n Eubank, Hidek i Kashioka A T R I n t e r p r e t i n g T e l e c o m m u n i c a t i o n s Labora to r i e s 2-2 Hikar idai Seilm-cho, So raku-gun Kyoto, Japan 619-02 black@air, i~l. co. jp kashioka@a~r, itl. co. jp eub~-~@atr, itl. c o . jp David M a g e r m a n Renaissance Technologies Corp . 25 East Loop Road, Suite 211 Stony Brook , IVY 11776 U S A magermau~rencec, corn
منابع مشابه
Three studies of grammar-based surface-syntactic parsing of unrestricted English text. A summary and orientation
Three studies of grammar-based surface parsing of unrestricted English text Voutilainen, Atro Tapio University of Helsinki, SF The dissertation addresses the design of parsing grammars for automatic surfacesyntactic analysis of unconstrained English text. It consists of a summary and three articles. Morphological disambiguation documents a grammar for morphological (or part-ofspeech) disambigua...
متن کاملThree Studies of Grammar-based Surface Parsing of Unrestricted English Text
Three studies of grammar-based surface parsing of unrestricted English text Voutilainen, Atro Tapio University of Helsinki, SF The dissertation addresses the design of parsing grammars for automatic surfacesyntactic analysis of unconstrained English text. It consists of a summary and three articles. Morphological disambiguation documents a grammar for morphological (or part-ofspeech) disambigua...
متن کاملDeveloping and Evaluating a Probabilistic LR Parser of Part-of-Speech and Punctuation Labels
We describe an approach to robust domain-independent syntactic parsing of unrestricted naturally-occurring (English) input. The technique involves parsing sequences of part-ofspeech and punctuation labels using a unification-based grammar coupled with a probabilistic LR parser. We describe the coverage of several corpora using this grammar and report the results of a parsing experiment using pr...
متن کاملA Richly Annotated Corpus for Probabilisfic Parsing
This paper describes the use of a small but syntactically rich parsed corpus of English in probabilistic parsing. Software has been developed to extract probabilistic systemic-f~nctional grammars (SFGs) from the Polytechnic of Wales Corpus in several formalisms, which could equally well be applied to other parsed corpora. To complement the large probabilistic grammar, we discuss progress in the...
متن کاملRobust German Noun Chunking With a Probabilistic Context-Free Grammar
We present a noun chunker for German which is based on a head-lexicalised probabilistic contextfree grammar. A manually developed grammar was semi-automatically extended with robustness rules in order to allow parsing of unrestricted text. The model parameters were learned from unlabelled training data by a probabilistic context-free parser. For extracting noun chunks, the parser generates all ...
متن کامل